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Abstract 



v*J We show that minimizers of convex functions subject to almost 

\^ ah Hnear perturbations are nondegenerate. An analogous result holds 

r~| more generally, for lower-C functions. 

-4— > 

B 

L—j 1 Introduction 

^ In this work we study the nature of minimizers of "typical" convex functions. 

^ We model this question by considering a fixed extended-real-valued convex 

Q function /, and then studying properties of minimizers of the perturbed 

'— ' function x t— )■ fy{x) = f{x) — v^x that hold for almost all values of the data 

IQ vector V G R" (in the sense of Lebesgue measure). 

Classical theory shows that, given a proper convex function /, the per- 
turbed function /„ typically has at most one minimizer. To see this, note 
first that we may assume / is closed, since any minimizer of / also minimizes 
its closure. Now we observe that the Fenchel conjugate /* is differentiable 



;_( almost everywhere on the interior of its domain, by Rademacher's theorem 
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(see for example [01 Theorem 9.60]), so for almost all vectors f , the subdiffer- 
ential df*{v) is either single- valued or empty. The result now follows, since 
this subdifferential coincides with the set {df)~^{v), which is exactly the set 
of minimizers of /^. 

Our aim is to strengthen this classical result. Minimizers x of the per- 
turbed function /^ are characterized by the property that the vector zero lies 
in the subdifferential dfv{x). We prove, for almost all vectors v, that the 
minimizer x is not only unique, but also nondegenerate, by which we mean 
that zero lies in the relative interior of the subdifferential: G ndfv{x) (or 
equivalently, the positive span R+9/^(a;) is a subspace). The proof, following 
an idea of [7|, uses a result in geometric measure theory due to Larman [S]. 

As an example, consider the standard linear programming problem 

max I v^x : aTx < 6^ (i = 1, 2, . . . , m) k 



xgH" I. " - . -? 5 



for given vectors a, G R" and scalars hi G R. We can restate this problem as 
minimizing the perturbed function f^ corresponding to the original function / 
that takes the value zero on the feasible region and +oo elsewhere. Consider 
an optimal solution x and the corresponding index set of active constraints, 
I = {i : ajx = bi}. Then we have 



dfv{x) = -v + { J^ Xiai : Xi > 0}, 
hdfy{x) = -v + {^Ajflj : Xi > 0}. 



Thus the minimizer x of /^ is nondegenerate exactly when there exists a 
dual-feasible solution A G R™ satisfying strict complementary slackness. We 
hence recover the well-known fact that, for almost all objective functions, if 
a linear program has an optimal solution, then that solution is unique and 
furthermore corresponds to a strictly-complementary-slack dual solution. 

For convex functions, critical points (those at which zero is a subgradient) 
coincide with minimizers. For nonconvex functions, we can more generally 
consider nondegeneracy of critical points. It transpires that our result on 
typical nondegeneracy extends in particular to all lower-C^ functions (those 
functions locally representable as sums of convex functions and quadratics). 
However, in more general contexts the result may fail. The classical gener- 
alization of the subdifferential of a convex function is the Clarke generalized 



gradient [3] , but [2] presents a locally Lipschitz function / : R — )■ R, whose 
Clarke generalized gradient dcf at any point x G R is the interval [—x, x]. In 
this case, the perturbed function f^ has a degenerate critical point for every 
non-zero value of v. 

2 Preliminaries 

2.1 Variational Analysis 

We recall some standard notions from variational analysis (see for example 
[9]). Consider the extended real line R:=RU{— oo}U{+oo}. We say that 
an extended-real- valued function is proper if it is never {— oo} and is not 
always {+oo}. 

For a function / : R" — )■ R, we define the domain of / to be 

dom/ = {x G R" : f{x) < +oo}, 

and we define the epigraph of / to be 

epi/ = {{x,r) e R" X R : r > f{x)}. 

A function is convex when its epigraph is convex, and closed when its epigraph 
is closed. 

Definition 2.1. Consider a set S C R" and a point x E S. The regular 
normal cone to 5 at a; , denoted Ns{x), consists of all vectors v G R" such 
that 

(f , X — x) < o(|a; — x|) for a; G 5, 

where we denote by o(|x — x|) for x G 5 a term with the property that 

o{\x-x\) ^^ 
\x — x\ 

when X — !■ X with x 7^ x. 

Definition 2.2. Consider a set S* C R" and a point x & S. The limiting 

normal cone to S at x, denoted Ns{x), consists of all vectors v G R" such 

s 
that there are sequences x^ — )■ x and f ^ — )■ w with Vr G Ns{xr). 



In the presence of convexity, normal cones have a much simpler form. 

Theorem 2.3. |3 Theorem 6.9] For a convex set S C R" and a point x & S, 
the regular and the limiting normal cones coincide, and consist of all vectors 
V G R" such that 

{v,x — x) < for all x ^ S. 

Normal cones allow us to study geometric objects. We now define subd- 
ifferentials, which allow us to analyze behavior of functions. 

Definition 2.4. Consider a function /: R" — )■ R and a point x G R" where 
/ is finite. The regular and the limiting suhdifferentials of / at x, respectively, 
are defined by 

df{x) = {^;GR":(t;,-l)GiVepi/(x,/(x))}, 
df{x) = {t;GR":(t;,-l)GiV,pi;(a;,/(a;))}. 

If the function / is convex, both sub differentials reduce to the classical 
convex subdifferential, 

[veYC : {v,x- x) < f{x) - f{x) for all x G R"}. 

Remark 2.5. For x G R" where f{x) is not finite, we follow the convention 
that df{x) = df{x) = 0. The regular and the limiting suhdifferentials are 
always closed sets, and the regular subdifferential is convex. 

Suhdifferentials play the role of generalized gradients in the following 
sense. 

Theorem 2.6. ^ Exercise 8.8] Consider a function /: R" — )■ R and a point 
X G R". // / can he written as f = g + h, where g is finite at x and h is C^ 
smooth on a neighborhood of x, then 

df{x) =dg{x) + Vh{x), 
df{x) =dg{x) + Vh{x). 

Theorem 2.7. /P, Theorem 12.12, 12.17] Let f : R" -^ H be a proper, con- 
vex function. Then on the set where the set-valued mapping {I -\-df)^^ takes 
nonempty values, it is single-valued and Lipschitz continuous with constant 
1. 



Remark 2.8. Theorem 2.7| is a special case of the celebrated theorem of Minty. 



See [6] or P Section 12. B] for more details. 

We now define a large and robust class of functions that includes both 
smooth functions and finite convex functions. 

Definition 2.9. [9, Theorem 10.33] A function /: O — ;■ R, where O is an 
open set in R", is said to be lower-C^ on O, if for each point x E O, there 
is a neighborhood around x and a scalar p such that on this neighborhood 
/ + p| ■ P is a finite convex function. 



By Theorem |2.6[ the regular and limiting sub differentials coincide for lower- 
ed functions. 

Remark 2.10. To illustrate the abundance of lower-C^ functions, consider the 
following example. Given C^ functions /j : O — ;■ R on an open set O C R" 
{i = l,...,m), the function / = max{/i, . . . , /^j is lower-C^ on O. For 
more details see |9l Chapter 10. F]. 

2.2 Hausdorff Measures 

For a set f/ C R", let diamf/ denote its diameter, that is 

diam(f/) = sup \x — y\. 

Definition 2.11. Consider a set 5* C R" and real numbers 6,d > 0. We 
define 

oo oo 

A;^(^) = inf I ^diam(f/i)'^ : S c[jUi, diam(f/i) < 6\. 

i=l i=l 

Observe the infimum in the definition above is taken over all countable 
covers {f/j} of S, such that diam(f/j) < 6 for each i. 

Definition 2.12. For a set 5 C R", define the d- dimensional Hausdorff 
measure of S to be 

It can be shown that for each d > 0, the set function A^ is an outer 
measure on R*^. Furthermore, if d is a positive integer, then on Lebesgue 
measurable sets in R"' the (i-dimensional Hausdorff measure is a rescaling 
of the d-dimensional Lebesgue measure. For more details, see (TU]. The 
following is an easy consequence of the definition of Hausdorff measure. 



Proposition 2.13. Consider a set S C R" and let f : S" — )■ R'" be a Lipschitz 
continuous mapping with Lipschitz constant k. Then for any real number 
d>0, we have XdifiS)) < n^XdiS). 

Corollary 2.14. Consider a set S C R" and let f : S ^ R"^ be a locally 
Lipschitz mapping. Then for any real number d > 0, if Xd{S) = then 
XdifiS)) = 0. 

Proof. Around each point x & S, consider a neigliborliood on wliicli / is 
Lipscliitz continuous. Tliis collection of neighborhoods forms a cover of S, 



and hence there is a countable subcover, say {V^}. By Proposition 2.13, for 
each index i we have XdifiVi)) = 0, and hence 

cxD n 

XdifiS)) = XdiljfiV^) < MmmiTXdiVi) = 0, 

i=l j=l 

as claimed. D 

Definition 2.15. Consider a compact, convex set F C R". The set of max- 
imizers argmax^g^(c, x) is called the exposed face of the set F corresponding 
to the vector c. In particular, the set F is itself an exposed face (correspond- 
ing to c = 0). All other exposed faces are said to be proper. 

For a convex set S C R", we will denote its closure, relative interior, and 
relative boundary by cl S, ri S, and rb S, respectively. To prove the main 
result, we will need the following two theorems. 

Theorem 2.16. (Larman)[5] Let S C R" be a compact convex set. Let N 
be the union of the relative boundaries of all the proper exposed faces. Then 
A„_i(Ar) = 0. 

Theorem 2.17. |2l Proposition 3] Suppose zero lies in the interior of the 
compact convex set F C R". Then the proper exposed faces of the polar set 
F° are those sets of the form 

G = {ce Npix) : (c,x) = 1}, 

for points x on the boundary of F. Furthermore, any such exposed face has 
relative interior given by 

ri G = {c e ri Npix) : (c, x) = 1}. 



3 Main Result 

3.1 SubdifFerentials of Convex Functions 

The unit sphere in R" will be denoted by §"^^, and an open ball of radius r 
around a point x G R*^ will be denoted by B{x,r). 

Lemma 3.1. Let F C R" be a convex set. Then 



„_i((|JrbiVp(x))n§"-i) =0. 



A.. . 

xeF 

Proof. Observe that Nf{x) = Ncif{x) for x G -F, so it is sufficient to show 
that the statement of the lemma holds for a closed convex set F. First, 
let us consider the case when F is a compact convex set. Without loss of 
generality, we can assume that zero is in the interior of F, since otherwise we 
can translate F, so as to have G riF, and then consider R" as the direct 
sum of the span of F and its orthogonal complement. Define 



G := y {c G rb Nf{x) : (c, x) = 1}. 



xeF 



Combining Theorems |2.16| and 2.17 , we deduce A„_i(G) = 0. Observe that 



G is contained in R" \ {0}. Now consider the mapping 

/ : R" \ {0} ^ §"-\ 



X I— > Ixl ^x. 



The mapping / is locally Lipschitz. Consequently, by Corollary |2.14[ we 
have A„_i(/(G)) = 0. Observe that the image set f{G) is contained in 
(ij^g^rb A^i;'(a;)) fl §"^^, since / simply scales each element of G. Now, to 
see the reverse inclusion, consider a vector c G [ih Np{x)) fl §"^^ for some 
vector X E F. By definition of the normal cone, we have 

(c, X — x) > 0, for all x E F. 

In particular, since lies in the interior of F, we have (c, x) > 0. So we 
deduce c := \{c,x)\~^c G G and /(c) = c. Thus we have shown 



fiG) = {[JThNF{x))nS^ 



jn— 1 
LU i\pyx) ) I I c 



and consequently 



A„_i((|JrbiVp(x))n§"-i) =0, 

x&F 



as we claimed. 

To get rid of the boundedness assumption on F, we will use a standard 
limiting argument. Assume that F is a closed convex set that is not neces- 
sarily bounded. For a positive integer k, let F^ = F fl -8(0, A;). Observe 

y rb Nf{x)\ t ( U rb Nf{x] 

xeB{0,k)nF x€F 

Thus we have 



An_i((|JrbiVp(x))n§"-i) = lim A„_i(( |J rbiV,.(a;)) n §"" 

xeF x£B{0,k)r\F 

x€B{0,k)nF 

xeB{o,k)nF 
= 0, 



where the final equality follows since -8(0, /c) flF is a compact convex set. D 

We need the following simple proposition. For future reference, we let 
tt: R"""*"^ — > R" be the canonical projection onto the first n coordinates. 

Proposition 3.2. Consider a convex function f: R" — )■ R and a point x G 
R". Then we have the relation, 

V eihdfix) ^ (w,-l) eihN^pifixJix)). 

Proof. Let K denote the normal cone, Ncpif{x, f{x)). If df{x) = 0, then 
there is no f G R" such that {v,—l) G rb-ft', and hence the result holds 
trivially. Assume that df{x) is nonempty. Observe 

riir^{t/GR"+^:y„+i>0}, 



since otherwise taking closures gives ?/„_|_i > for all y & K and hence we 
have df{x) = 0, which is a contradiction. Thus there exists a point y &nK 
with yn+i < 0. Since i^ is a cone, we can rescale to get y E liK with 
yn+i = -I. Hence 

riirn{yGR"+i:2/fc+i = -l}^0. 

Using [9l Proposition 2.42], we deduce that 

(3.1) n{Kn{y e R"+i : y^+i = -1}) = nKn{y e R"+i : y^+i = -1}. 

Finally, we have 

rid fix) = 7r(ri(irn{y G R"+^ : yk+i = -1})) = {v : {v,-l) G riiT}, 



where the last equality follows from (3.1). Taking compliments, the result 



follows. D 

Theorem 3.3. Let f : R" -^ H be a convex function. Then the set 



U rhdfix) 
xeR" 



is Lehesgue null. 
Proof. Let 



H_, := {x G R"+i : x„+i = -1}, 
H< := {x G R"+^ : x„+i < 0}, 

K:=(^ U rbiVepi/(x,/(x)))n§"niJ<. 



xSdom/ 



Applying Lemma 3.1 to epi/, we deduce \n{,K) = 0. Consider the mapping 



Observe that (f) is locally Lipschitz, and therefore by Corollary 2.14 we have 



Xn{(f){K)) = 0. From Proposition 3.2| we have 



7ro0(ir)= [j rhdfix). 

xSdom/ 

Since vr is Lipschitz as well, we deduce A„(lJ^gj^„ rb9/(s)) = 0. Since Haus- 
dorff measures are Borel-regular [H Section 2.10.2], the set UxeR" ^^^•/'(•^) 
is Lebesgue measurable and has Lebesgue measure zero. D 

9 



Definition 3.4. Consider a convex function /: R" — )• R. A minimizer 
X G R" of / is said to be nondegenerate if it satisfies the property G ri df{x). 

Corollary 3.5. Let /: R" — )■ R 6e a proper convex function. Consider the 
collection of perturbed functions fv{x) = f{x) — {v,x), indexed by vectors 
V G R'^. Then for a full measure set of vectors v G R", the function f^ has 
at most one minimizer, which furthermore is nondegenerate. 

Proof. The uniqueness part of the claim is classical, as discussed in the in- 
troduction. Thus it is sufficient to show that for a full measure set of vec- 
tors V G R", every critical point of /^ is nondegenerate. Indeed, we have 



G Thdfv{x) -^ V G rhdf{x). By Theorem 3.3, the set of vectors v for 
which V G Thdf{x) for some x G R" has Lebesgue measure zero, and so the 
result follows. D 

3.2 Extension to lower-C^ functions 



Having proved Theorem 3.3, we can now easily extend this theorem to a 
nonconvex situation. In particular, shortly we will show that an analogous 
statement holds for all lower-C^ functions. 

Theorem 3.6. Consider a proper function f: R" — )■ R with the property 
that for any point x in its domain, there is a neighborhood V around x such 
that on V, the function f admits the representation f = g — \p\ ■ P, where g 
is a convex function and p is a positive real number. Then the set 

U vhdfix) 

is Lebesgue null. 



Remark 3.7. In Theorem 3^, unlike in the definition of lower-C^ functions, 
the domain of / is not required to be an open set and the convex function g 
in the local representation of / is not required to be finite. 

Proof. For each point x G dom/, consider the neighborhood guaranteed to 
exist by our assumption on /. This collection of neighborhoods is an open 
cover of the domain of /, and hence has a countable subcover, say {Vi}. 
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Consider an arbitrary set Vi from this cover. On Vi, we have f = g — \p\ 
and hence 

(3.2) M ihdf{x) = M Thdg{x) — px 

= M rb {dg{x) + x) — {p + l)x. 



xGVindom f 



Consider the map 



H: [J ih{dg{x) + x)^ [J ihdf{x), 

xeViCidomf xGVi 

c^c-{p+l){dg + I)-\c). 



In hght of (3.2) and Theorem 2.7, the mapping H is well-defined, surjective, 
and Lipschitz continuous. Observe 

A„( U Th {dg{x) + x)) = Xn{ U rb%(-) + ^|-r)(x))=0, 
where the last equality follows from convexity of (? + ^ | ■ p and Theorem 



2.14 



we have A„ ( [Jx&v, ^^ df{x] 



3.3 



From the equation above and Corollary 

0. Since Hausdorff measures are Borel-regular, the set {Jx^y.T^hdf^x) is 
Lebesgue measurable and has Lebesgue measure zero. Finally, since {V^} 
is a countable cover of dom /, it easily follows from a limiting argument that 
UxeR" rbc}/(x) is a Lebesgue null set, as was claimed. D 

Corollary 3.8. Let f : O ^^ H be a lower-C"^ function on an open set O C 
R". Then the set 

M Thdf{x). 
xeR" 
is Lebesgue null. 



Proof. From Definition |2.9[ / satisfies the conditions of Theorem 3.6, and 



hence the result follows. D 

Definition 3.9. Let f:0 — t- R be a lower-C^ function on an open set 
O C R". We say that a point x G R" is critical for the function / if 
G df{x), and we call such a critical point x nondegenerate if the stronger 
property G Tidf{x) holds. 

11 



Corollary 3.10. Let f : O ^)- H be a lower-C^ function on an open set 
O C R". Consider the collection of perturbed functions fv{x) = f{x) — {v, x), 
indexed by vectors v G R". Then for a full measure set of vectors v G R", 
every critical point of the function f^ is nondegenerate. 



Proof. We have G Thdfy{x) <^ f G ihdf{x). By Corollary 3.8, the set 
of vectors v for which v G Thdf{x) for some x G R** has Lebesgue measure 
zero, and so the result follows. D 

4 A conjecture 



We can formulate Theorem |3.3| in terms of monotone set- valued mappings. 
See ini Chapter 12] for the definitions. If we restrict our attention in the 



theorem to closed proper convex functions /, then Theorem 3^ is equivalent 
to the statement that for a maximal cyclically-monotone mapping F : R" =^ 
R", the image of the set- valued map x i— ;■ rb F{x) has Lebesgue measure zero 
(see [HI Theorem 12.25]). We make the following related conjecture. 

Conjecture 4.1. Let F: R" =^ R" be a maximal monotone mapping. Then 
the image of the map x i— )■ ThF{x) has Lebesgue measure zero, that is, the 
set 

[j ihF{x) 

is Lebesgue null. 



A proof of Conjecture 4.1, along with the techniques presented in this 



paper, might extend the result of Corollary 3^ to the class of "prox-regular" 
functions t8j. 
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